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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: MURPHY, Dennis 

RE ID, John 

(ii) TITLE OF INVENTION: Alpha Glycosidase 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

CECCHI, STEWART & OLSTEIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROSELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: ASCII 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unassigned 

(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: HERRON, CHARLES J. 

(B) REGISTRATION NUMBER: 28,019 

(C) REFERENCE /DOCKET NUMBER: 331400-40 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-894-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDKA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGAGCG CTCGTCTTTC AC 



(2) INFOPMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STHANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CGGAAGATCT AGGTTCCCCA TTTTCACCCC T 

(2) INFORMATION FOR SEQ ID NO: 3: 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
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«C SCO AGT »C CIO ATA Ss 

Gly He Ala Ser Asp Leu He Glu He lie laxy ^ 



65 



GCA ATA CTC CCC CTC CTC CCG CTT AGC ^^^^ ^ 

Ala He Leu Pro Leu Leu Pro Leu ser Arg 
85 ' 

100 



31 



48 



96 



144 



192 



240 



288 



336 
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TTC TGG CTG CCA GAG CTC GCC TAT GAC CCG ATA ATC CCT GCC ATA CTG 384 
Phe Trp Leu Pro Glu Leu Ala Tyr Asp Pro He He Pro Ala He Leu 
115 120 125 



AAG GAC AAC GGT TAT GAG TAT CTA TTC GCC GAC GGG GAG GCG ATG CTT 432 
Lvs Asp Asn Gly Tyr Glu Tyr Leu Phe Ala Asp Gly Glu Ala Met Leu 
130 135 140 

TTC TCA GCT CAT CTC AAC TCG GCG ATA AAG CCA ATT AAA CCG CTC TAT 480 
Phe Ser Ala His Leu Asn Ser Ala He Lys Pro He Lys Pro Leu Tyr 
145 150 155 160 

CCA CAC CTT ATA AAG GCC CAA AGG GAA AAG CGC TTT AGG TAG ATC AGC 528 
Pro His 3Leu He Lys Ala Gin Arg Glu Lys Arg Phe Arg Tyr He Ser 
165 170 175 

TAT CTC CTT GGT CTC AGG GAG CTT AGG AAG GCG ATA AAG CTC GTT TTT 576 
Tyr Leu Leu Gly Leu Arg Glu Leu Arg Lys Ala He Lys Leu Val Phe 
180 135 190 

GAA GGT AAG GTA ACG CTA AAG GCA GTC AAA GAC ATC GAA GCC GTA CCC 624 
Mil Glu Gly Lys Val Thr Leu Lys Ala Val Lys Asp He Glu Ala Val Pro 



m 



195 200 205 

GTT TGG GTG GCC GTG AAC ACG GCT GTA ATG CTC GGC ATC GGA AGG CTT 672 
Val Trp Val Ala Val Asn Thr Ala Val Met Leu Gly He Gly Arg Leu 
210 215 220 

CCT CTT ATG AAT CCT AAG AAA GTG GCG AGC TGG ATA GAG GAC AAG GAC 720 
Pro Leu Met Asn Pro Lys Lys Val Ala Ser Trp He Glu Asp Lys Asp 
225 230 235 240 

AAC ATT CTT CTA TAG GGC ACC GAT ATA GAG TTC ATT GGC TAT AGG GAC 768 
Asn He Leu Leu Tyr Gly Thr Asp He Glu Phe He Gly Tyr Arg Asp 
245 250 255 

ATT GCA GGC TAC AGA ATG AGT GTT GAG GGA TTA TTA GAG GTT ATA GAC 816 
He Ala Gly Tyr Arg Met Ser Val Glu Gly Leu Leu Glu Val He Asp 
260 265 270 

GAG CTC AAC TCG GAA CTG TGC CTT CCC TCA GAG CTG AAG CAC AGT GGA 864 
Glu Leu Asn Ser Glu Leu Cys Leu Pro Ser Glu Leu Lys- His Ser Gly 
275 280 285 

AGG GAG CTC TAC TTA CGG ACT TCG AGT TGG GCA CCA GAT AAG AGC TTG 912 
Arg Glu Leu Tyr Leu Arg Thr Ser Ser Trp Ala Pro Asp Lys Ser Leu 
290 295 300 
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ATA TGG AGA GA W C GAA GGG AAC GCA AGA CTT AAT ^CTG' TCC 960 



AGG ATA TGG AGA GAW C GAA GGG AAC GCA AGA CTT AAT 
a| lie ?^ ^ G^P Glu Gly Asn Ala Ar| Leu Asn ||Leu Ser 



TAG AAT ATG AGG GGC GAA CTC GCC CTT TTA GCC GAG AAC AGC GAT GCA 
?Jr ^ Giy Glu Leu Ala Phe Leu Ala Glu Asn Ser Asp Ala 

325 330 



ATA TAT AAC GAT TGG AGG GGT GAA AAT GGG GAA CCr TAG 
Yle Tyr Asn Asp Trp Arg Gly Glu Asn Gly Glu Pro End 

355 360 365 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 364 AMINO ACIDS 

(B) TYPE: POLYPEPTIDE 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Leu Arg Ala Leu Val Phe Mis Gly Asn Leu Gin Tyr Ala Glu lie 

5 10 15 



Phe Lys Ser Glu He Pro Lys Val He Glu Lys Ala Tyr He Pro 

'-'i 90 25 -30 



Val He Glu Thr Leu He Lys Glu Gin He Pro Phe Gly Leu Asn 
35 40 45 

He Thr Gly Tyr Thr Leu Lys Phe Leu Pro Lys Asn He He Asp 
50 55 60 



Leu val Lys Gly Gly He Ala Ser Asp Leu He Glu He He Gly 
65 70 75 

Thr Ser Tyr Tyr His Ala He Leu Pro Leu Leu Pro Leu Ser Arg 



80 



Val Glu Ala Glu Val Gin Arg Asp Arg Glu Val Lys Glu Glu Leu 

100 lOb 



95 



Phe Glu Val Ser Pro Lys Gly Phe Trp Leu Pro Glu Leu Ala Tyr 

110 H5 120 

Asn Pro He He Phe Ala He Leu Lys Asp Asn Gly Tyr Glu Tyr 
125 130 135 

Leu Phe Ala Asp Gly Glu Ala Met Leu Phe Ser Ala His Leu Asn 
140 145 
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AOr GGA TGG GAG CCC CTC CCT GAG AGG AGG CTG GAT GCC TTC CGG GCG 1047 
?S ?S pro Su pro Glu Arg Arg Leu Asp Ala Phe Arg Ala 
340 345 -5 = ^ 



1086 



Ser Ala He Lys Pro He Lys Pro Leu Tyr Pro His Leu He Lys 
155 160 165 

Ala Gin Arg Glu Lys Arg Phe Arg Tyr He Ser Tyr Leu Leu GLy 
170 175 180 

Leu Arg Glu Leu Arg Lys Ala He Lys Leu Val Phe Glu Gly Lys 
185 190 

val Thr Leu Lys Ala Val Lys Asp He Glu Ala Val Pro Val Trp 
200 205 210 



Val Ala val Asn Thr Ala Val Met Leu Gly He Gly Arg Leu Pro 
215 220 225 

Leu Met As? Pro Lys Lys Val Ala Ser Trp He Glu Asp Lys Asp 
230 235 240 

Asn He Leu Leu Tyr Gly Thr Asp He Glu Phe He Gly Tyr Arg 
245 250 255 

Asp He Ala Gly Tyr Arg Met Ser Val Glu Gly Leu Leu Glu Val 
260 265 270 

He Asp Glu Leu Asn Ser Glu Leu Cys Leu Pro Ser Glu Leu Lys 

285 



275 



280 



His Ser Gly Arg Glu Leu Tyr Leu -Arg Thr Ser Ser Trp Ala Pro 
290 295 300 

ASP Lvs Ser Leu Arg He Trp Arg Glu Asp Glu Gly Asn Ala Arg 
305 310 315 

Leu Asn Met Leu Ser Tyr Asn Met Arg Gly Glu Leu Ala Phe Leu 
320 325 330 

Ala Glu Asn Ser Asp Ala Arg Gly Trp Glu Pro Leu Pro Gin Arg 
335 340 345 

Arg Leu Asp Ala Phe Arg Ala He Tyr Asn Asp Trp Arg Gly Glu 
^ 350 355 360 

Asn Gly Glu Pro 
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